The principal research question for this project, is to determine whether or not the 421a Tax Abatement, designed to stimulate housing development in New York City, became a driver in the decrease of tha city’s Black population. Over the years a variety of news outlets, as well as casual observations have all seemed to support the commonly accepted idea that the Black population in NYC has decreased.
In 2012, the title of an article in one of NYC’s local newspaper’s, reads “Black population surges in East New York as it falls across the borough and city” 1. The article goes on to detail the population losses, primarily in Brooklyn, 2000-2010. Also, very recent data that shows, Brooklyn, New York’s overall population has decreased from 2016 to 2017, the first annual drop since 2010 2.
However, a phrase from the above article, captures the central mission of our research study. The article states “Whether gentrification leads directly to displacement is a separate question…” - this is precisely the question we’re aiming to answer. While there’s no agreement on what defines gentrification, or at least its measurable indicators (housing costs, income, etc), this research study takes another angle. This study operationalizes the concept of gentrification, by a single policy initiative - NYC’s official count of 421a tax abated units. The research question becomes more specific, has New York City’s 421a Tax Abatement policy, caused the displacement of Blacks and others in New York City?. This is a very different question that asking whether the symptoms of gentrification, such as higher housing costs or a decrease in manufacturing jobs, have caused population displacement. This research question seeks to quantify the causal relationship between policy and outcomes, if any.
The second policy consideration is to determine whether or not, New York Police Departments “Stop & Frisk” practice, during its peak years, was also a causal factor in displacement vis a vis being enforced in areas with the highest amount of 421 tax abated housing.
Finally, a supplemental component of our study, was to perform the same quantitative analysis used to measure the impacts on Blacks, also on the Puerto Rican subset of the Latino community.
The American Community Survey (ACS) data so far has actually shown that the compound annual growth rate (CAGR) is positive for the overall population of NYC (Overall), as well as the African-American/Black (Black), White & Latino racial/ethnic groupings in New York City from 2005-2016 3. However, of the five separate racial/ethnic groupings in our analysis - Puerto Rican was the only one to see a negative CAGR from 2005-2016.
Tallies for 421a abated units, as well as location, were provided by the New York City Department of Finance4. Data is available from 2013-2018, severely limiting the scope of our analysis. However, it’s important to note that data in each year is cumulative from all previous years, minus units whose 421a tax abatement had expired.
Finally, NYPD Stop & Frisk data spans 2003-2016 5. It’s important to note that policy changes in 2012, essentially ended this practice in New York City. However,the question still remains, was Stop & Frisk effective tool in population displacement during its peak year 2006-2011, by being enforced in the areas with newly built some tax abated housing?
Although the population in New York City, is more white, the Black and Latino populations have also grown. It’s the Puerto Rican community, that best demonstrates the link between the 421a Tax Abatement policy and displacement given its population losses, and portion of 421a units in areas dominated by this ethnic group.
The 421a tax abated unit counts, are “as of” or cumulative for each year that it is provided. The analysis and visualization provided below, are all tax abated units in NYC as of fiscal year 2017-18.
Of the 67,007 tax abated units that exist in NYC as of fiscal year 2017-18, 47% or 31,776 are Brooklyn. Queens follows with 25% or 16,645. Further when Williamsburg North, South, East and Central are combined - it represents 7,916 tax abated units - the largest in the entire city, followed by Flushing Queens at 4,186.
The chart below provides a visualization of NYC’s overall population growth by race, spanning 2005-2016, as estimated by ACS annually. Although the CAGR for Blacks is .01, and for whites it’s .01, visually it appears that instead of Blacks being negatively impacted by Whites - the opposite is true.
In fact, our Pearson’s R measurement yields a .97 between the two groups, indicating an almost perfectly positive linear relationship.
However, the Puerto Rican trend is negatively correlated with the overall population (-.64) and all Latinos (-.91).
| Overall | White | Black | Other Latino | Puerto Rican |
|---|---|---|---|---|
| 0.01 | 0.02 | 0.01 | 0.02 | -0.01 |
## nycblkdf.Overall.Population
## nycblkdf.Overall.Population 1.00
## nycblkdf.White.Population 0.66
## nycblkdf.Black.Population 0.70
## nycprdf.Puerto.Rican.Population -0.64
## nycprdf.Latino.Population 0.84
## nycblkdf.White.Population
## nycblkdf.Overall.Population 0.66
## nycblkdf.White.Population 1.00
## nycblkdf.Black.Population 0.97
## nycprdf.Puerto.Rican.Population -0.42
## nycprdf.Latino.Population 0.63
## nycblkdf.Black.Population
## nycblkdf.Overall.Population 0.70
## nycblkdf.White.Population 0.97
## nycblkdf.Black.Population 1.00
## nycprdf.Puerto.Rican.Population -0.35
## nycprdf.Latino.Population 0.59
## nycprdf.Puerto.Rican.Population
## nycblkdf.Overall.Population -0.64
## nycblkdf.White.Population -0.42
## nycblkdf.Black.Population -0.35
## nycprdf.Puerto.Rican.Population 1.00
## nycprdf.Latino.Population -0.91
## nycprdf.Latino.Population
## nycblkdf.Overall.Population 0.84
## nycblkdf.White.Population 0.63
## nycblkdf.Black.Population 0.59
## nycprdf.Puerto.Rican.Population -0.91
## nycprdf.Latino.Population 1.00
Given the descriptive statistics and correlation analyses above, the next step is to determine whether or not we can have statistical confidence in the directional measures that have been computed thus far.
Although step-wise regression modeling is useful, given the small number of variables and observations, manually variable selection is also appropriate for this use case.
Given that our data is numeric and continuous, a linear regression model is the right choice to start investigating relationships between variables.
See the Appendix A for summary and data distributions
The initial regression (race_regression) has the overall NYC population trend (Overall.Population) as the dependent variable, while the trends for Black, White, Latino & Puerto Rican are independent variables. This model, risks multi-collinearity because the Puerto Rican trend is a subset of the Latino trend. This model only has two variables that are statistically significant at a 90% confidence interval (CI) - Black (p = .08) and Latino (p = .05). Each of these groups have a positive coefficient with overall population growth - consistent with the results in the preliminary correlation matrix. It’s also worth noting, that the White trend has a negative correlation with the overall population trend, although not statistically significant at a 90% CI, with a p-value = .11. The overall R-squared for this model is .87.
It’s important to note that the signs of our coefficients are not related to the growth rate, but rather the relationship between each independent variable and the specified dependent variable
In race_regressionA, the Puerto Rican trend variable is removed, while the overall population trend variable remains as our dependent variable - the R-squared is .86. All explanatory variables are statistically significant at 90% CI or higher - Black (p = .03), White (p = .06) & Latino (p = .01). Again, Black & Latino have a positive slope coefficient. The White trend has a negative slope coefficient, and in this model it’s statistically significant at a 90% CI.
Essentially the white population growth rate trends differently from the overall population growth on NYC, as well as some other groups.
Given that our research question places NYC’s Black population as the subject, and alternatively Puerto Ricans, the last two regression models have these groups as dependent variables.
The model, race_regressionD, has Black as the dependent variable - with Overall, White & Puerto Rican as the independent variables. At a 90% CI or higher, all of the other independent variables have positive slope coefficient - Overall (p = .04), White (p = .00) and Puerto Rican (p = .06).
In model race_regression E, with Puerto Rican as the dependent variable, Black is the only variable with a positive slope coefficient that is statistically significant at 90% CI (p = .06). The other two variables, Overall & White have a negative slope coefficient and each of them is statistically significant at 90% CI or higher Overall (p = .02), and White (p = .06).
##
## Call:
## lm(formula = nycblkdf$Overall.Population ~ nycblkdf$Black.Population +
## nycblkdf$White.Population + nycprdf$Puerto.Rican.Population +
## nycprdf$Latino.Population)
##
## Residuals:
## Min 1Q Median 3Q Max
## -93418 -37472 -18847 35125 109409
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) -2.022e+06 3.074e+06 -0.658 0.5317
## nycblkdf$Black.Population 2.945e+00 1.467e+00 2.008 0.0846 .
## nycblkdf$White.Population -4.802e-01 2.598e-01 -1.848 0.1070
## nycprdf$Puerto.Rican.Population 1.480e+00 1.882e+00 0.786 0.4576
## nycprdf$Latino.Population 2.073e+00 8.826e-01 2.349 0.0511 .
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 76070 on 7 degrees of freedom
## Multiple R-squared: 0.8706, Adjusted R-squared: 0.7966
## F-statistic: 11.77 on 4 and 7 DF, p-value: 0.003158
##
## Call:
## lm(formula = nycblkdf$Overall.Population ~ nycblkdf$Black.Population +
## nycblkdf$White.Population + nycprdf$Latino.Population)
##
## Residuals:
## Min 1Q Median 3Q Max
## -65309 -48072 -27043 26333 122358
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) -2.289e+05 2.011e+06 -0.114 0.91219
## nycblkdf$Black.Population 3.449e+00 1.287e+00 2.680 0.02792 *
## nycblkdf$White.Population -5.361e-01 2.438e-01 -2.199 0.05913 .
## nycprdf$Latino.Population 1.430e+00 3.211e-01 4.452 0.00213 **
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 74230 on 8 degrees of freedom
## Multiple R-squared: 0.8591, Adjusted R-squared: 0.8063
## F-statistic: 16.26 on 3 and 8 DF, p-value: 0.0009129
##
## Call:
## lm(formula = nycblkdf$Overall.Population ~ nycblkdf$Black.Population +
## nycblkdf$White.Population + nycprdf$Puerto.Rican.Population)
##
## Residuals:
## Min 1Q Median 3Q Max
## -141660 -32653 -12828 31053 117752
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 3.694e+06 2.350e+06 1.571 0.1547
## nycblkdf$Black.Population 4.200e+00 1.709e+00 2.458 0.0394 *
## nycblkdf$White.Population -5.685e-01 3.216e-01 -1.768 0.1150
## nycprdf$Puerto.Rican.Population -2.623e+00 8.778e-01 -2.988 0.0174 *
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 95160 on 8 degrees of freedom
## Multiple R-squared: 0.7685, Adjusted R-squared: 0.6817
## F-statistic: 8.852 on 3 and 8 DF, p-value: 0.006379
##
## Call:
## lm(formula = nycblkdf$Overall.Population ~ nycprdf$Puerto.Rican.Population)
##
## Residuals:
## Min 1Q Median 3Q Max
## -254270 -40167 20834 75967 167566
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 1.048e+07 8.088e+05 12.962 1.41e-07 ***
## nycprdf$Puerto.Rican.Population -2.888e+00 1.083e+00 -2.666 0.0236 *
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 135200 on 10 degrees of freedom
## Multiple R-squared: 0.4155, Adjusted R-squared: 0.3571
## F-statistic: 7.11 on 1 and 10 DF, p-value: 0.02364
##
## Call:
## lm(formula = nycblkdf$Black.Population ~ nycblkdf$Overall.Population +
## nycblkdf$White.Population + nycprdf$Puerto.Rican.Population)
##
## Residuals:
## Min 1Q Median 3Q Max
## -20175 -6536 -1158 8386 25667
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 3.602e+05 4.002e+05 0.900 0.3944
## nycblkdf$Overall.Population 1.024e-01 4.168e-02 2.458 0.0394 *
## nycblkdf$White.Population 1.621e-01 1.490e-02 10.878 4.51e-06 ***
## nycprdf$Puerto.Rican.Population 3.525e-01 1.557e-01 2.264 0.0534 .
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 14860 on 8 degrees of freedom
## Multiple R-squared: 0.9693, Adjusted R-squared: 0.9578
## F-statistic: 84.29 on 3 and 8 DF, p-value: 2.15e-06
##
## Call:
## lm(formula = nycprdf$Puerto.Rican.Population ~ nycblkdf$Black.Population +
## nycblkdf$Overall.Population + nycblkdf$White.Population)
##
## Residuals:
## Min 1Q Median 3Q Max
## -30160 -11270 -3264 7090 50026
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 7.852e+05 6.908e+05 1.137 0.2886
## nycblkdf$Black.Population 1.108e+00 4.893e-01 2.264 0.0534 .
## nycblkdf$Overall.Population -2.011e-01 6.729e-02 -2.988 0.0174 *
## nycblkdf$White.Population -1.797e-01 8.359e-02 -2.150 0.0638 .
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 26350 on 8 degrees of freedom
## Multiple R-squared: 0.6438, Adjusted R-squared: 0.5102
## F-statistic: 4.82 on 3 and 8 DF, p-value: 0.03347
ACS: NYC RACE BY YEAR & BOROUGH
ACS: NYC BLACKS & PUERTO RICANS BY YEAR & TOP NEIGHBORHOOD
Based on our descriptive data, visualization and multiple regression analysis we’re able to determine that there a statistically significant negative and positive relationships between the growth rates and trends of several racial/ethnic groups in New York City.
It seems that the White population growth trend is only positive correlated with the Black growth trend, but has an inverse relationship with the overall population and Puerto Ricans. It’s possible that gentrification by whites adversely impacted historically Puerto Rican neighborhoods, while having no net impact on Blacks. It’s also interesting that although Blacks and Whites have near perfect linear correlation, positive CAGR and positive regression slope coefficients with each other - that their relationship to the Puerto Rican growth trend is opposite that of each other (see race_regressonD & race_regressionE).
Without more historical data for NYC 421a tax abatement program, very little can be done to link it to growth rates of different racial/ethnic groups within NYC or to the NYPD’s policy of Stop & Frisk from 2003-2011. However, what we have been to do is learn a few key insights concerning our research question. First, it’s futile to try and determine the causes of NYC’s Black population decrease, because based on the ACS data - it’s actually increasing. Perhaps a new question should be asked, one that isolates African-Americans, as opposed to including immigrant groups which may masks net losses among African-Americans. So far, we have learned that the White growth rate is negatively related to the overall and Latino growth rate of NYC.
Finally, we discovered that the Puerto Rican subset within the Latino group is actually experiencing negative growth, which seems to be conversely related to the growth of whites and other Latinos. It could be that whites are directly impacting the population decrease among Puerto Ricans, while other Latino groups are increasign inhabiting non-gentrified areas of NYC.
We’ve established where tax abated units have been most constructed, and we’ve established negative population growth for Puerto Ricans. However, without historical data for the 421a tax abatement we cannot establish a statistically significant link between the two.
## Min. 1st Qu. Median Mean 3rd Qu. Max.
## 7956000 8237000 8350000 8329000 8427000 8550000
## Min. 1st Qu. Median Mean 3rd Qu. Max.
## 2746000 3440000 3634000 3475000 3652000 3912000
## Min. 1st Qu. Median Mean 3rd Qu. Max.
## 1894000 2023000 2057000 2040000 2075000 2140000
## Min. 1st Qu. Median Mean 3rd Qu. Max.
## 2223000 2303000 2361000 2368000 2441000 2492000
## Min. 1st Qu. Median Mean 3rd Qu. Max.
## 686800 719800 746300 745900 783300 788600